319 research outputs found

    Limit distributions for large P\'{o}lya urns

    Get PDF
    We consider a two-color P\'{o}lya urn in the case when a fixed number SS of balls is added at each step. Assume it is a large urn that is, the second eigenvalue mm of the replacement matrix satisfies 1/2<m/S≤11/2<m/S\leq1. After nn drawings, the composition vector has asymptotically a first deterministic term of order nn and a second random term of order nm/Sn^{m/S}. The object of interest is the limit distribution of this random term. The method consists in embedding the discrete-time urn in continuous time, getting a two-type branching process. The dislocation equations associated with this process lead to a system of two differential equations satisfied by the Fourier transforms of the limit distributions. The resolution is carried out and it turns out that the Fourier transforms are explicitly related to Abelian integrals over the Fermat curve of degree mm. The limit laws appear to constitute a new family of probability densities supported by the whole real line.Comment: Published in at http://dx.doi.org/10.1214/10-AAP696 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Support and density of the limit mm-ary search trees distribution

    Get PDF
    The space requirements of an mm-ary search tree satisfies a well-known phase transition: when m≤26m\leq 26, the second order asymptotics is Gaussian. When m≥27m\geq 27, it is not Gaussian any longer and a limit WW of a complex-valued martingale arises. We show that the distribution of WW has a square integrable density on the complex plane, that its support is the whole complex plane, and that it has finite exponential moments. The proofs are based on the study of the distributional equation W\egalLoi\sum_{k=1}^mV_k^{\lambda}W_k, where V1,...,VmV_1, ..., V_m are the spacings of (m−1)(m-1) independent random variables uniformly distributed on [0,1][0,1], W1,...,WmW_1, ..., W_m are independent copies of W which are also independent of (V1,...,Vm)(V_1, ..., V_m) and λ\lambda is a complex number

    Smoothing equations for large P\'olya urns

    Full text link
    Consider a balanced non triangular two-color P\'olya-Eggenberger urn process, assumed to be large which means that the ratio sigma of the replacement matrix eigenvalues satisfies 1/2<sigma <1. The composition vector of both discrete time and continuous time models admits a drift which is carried by the principal direction of the replacement matrix. In the second principal direction, this random vector admits also an almost sure asymptotics and a real-valued limit random variable arises, named WDT in discrete time and WCT in continous time. The paper deals with the distributions of both W. Appearing as martingale limits, known to be nonnormal, these laws remain up to now rather mysterious. Exploiting the underlying tree structure of the urn process, we show that WDT and WCT are the unique solutions of two distributional systems in some suitable spaces of integrable probability measures. These systems are natural extensions of distributional equations that already appeared in famous algorithmical problems like Quicksort analysis. Existence and unicity of the solutions of the systems are obtained by means of contracting smoothing transforms. Via the equation systems, we find upperbounds for the moments of WDT and WCT and we show that the laws of WDT and WCT are moment-determined. We also prove that WDT is supported by the whole real line and admits a continuous density (WCT was already known to have a density, infinitely differentiable on R\{0} and not bounded at the origin)

    Digital search trees and chaos game representation

    Get PDF
    In this paper, we consider a possible representation of a DNA sequence in a quaternary tree, in which on can visualize repetitions of subwords. The CGR-tree turns a sequence of letters into a digital search tree (DST), obtained from the suffixes of the reversed sequence. Several results are known concerning the height and the insertion depth for DST built from i.i.d. successive sequences. Here, the successive inserted wors are strongly dependent. We give the asymptotic behaviour of the insertion depth and of the length of branches for the CGR-tree obtained from the suffixes of reversed i.i.d. or Markovian sequence. This behaviour turns out to be at first order the same one as in the case of independent words. As a by-product, asymptotic results on the length of longest runs in a Markovian sequence are obtained

    Variable length Markov chains and dynamical sources

    Full text link
    Infinite random sequences of letters can be viewed as stochastic chains or as strings produced by a source, in the sense of information theory. The relationship between Variable Length Markov Chains (VLMC) and probabilistic dynamical sources is studied. We establish a probabilistic frame for context trees and VLMC and we prove that any VLMC is a dynamical source for which we explicitly build the mapping. On two examples, the ``comb'' and the ``bamboo blossom'', we find a necessary and sufficient condition for the existence and the unicity of a stationary probability measure for the VLMC. These two examples are detailed in order to provide the associated Dirichlet series as well as the generating functions of word occurrences.Comment: 45 pages, 15 figure

    Martingales and Profile of Binary Search Trees

    Full text link
    We are interested in the asymptotic analysis of the binary search tree (BST) under the random permutation model. Via an embedding in a continuous time model, we get new results, in particular the asymptotic behavior of the profile

    B-urns

    Full text link
    The fringe of a B-tree with parameter mm is considered as a particular P\'olya urn with mm colors. More precisely, the asymptotic behaviour of this fringe, when the number of stored keys tends to infinity, is studied through the composition vector of the fringe nodes. We establish its typical behaviour together with the fluctuations around it. The well known phase transition in P\'olya urns has the following effect on B-trees: for m≤59m\leq 59, the fluctuations are asymptotically Gaussian, though for m≥60m\geq 60, the composition vector is oscillating; after scaling, the fluctuations of such an urn strongly converge to a random variable WW. This limit is C\mathbb C-valued and it does not seem to follow any classical law. Several properties of WW are shown: existence of exponential moments, characterization of its distribution as the solution of a smoothing equation, existence of a density relatively to the Lebesgue measure on C\mathbb C, support of WW. Moreover, a few representations of the composition vector for various values of mm illustrate the different kinds of convergence

    Growing conditioned trees

    Get PDF
    AbstractFor a Markovian branching particle system in Rd a Palm type distribution on the genealogical trees up to a time horizon t is computed, which generically (i.e. if there are almost surely no multiplicities in the particle positions at time t) can be viewed as a conditional distribution on the trees given that the particle system at time t populates a certain site. The result is obtained in two different ways: by conditioning on the first branching and by means of Kallenberg's method of backward trees

    Persistent random walks, variable length Markov chains and piecewise deterministic Markov processes *

    Get PDF
    International audienceA classical random walk (S t , t ∈ N) is defined by S t := t n=0 X n , where (X n) are i.i.d. When the increments (X n) n∈N are a one-order Markov chain, a short memory is introduced in the dynamics of (S t). This so-called " persistent " random walk is nolonger Markovian and, under suitable conditions, the rescaled process converges towards the integrated telegraph noise (ITN) as the timescale and space-scale parameters tend to zero (see [11, 17, 18]). The ITN process is effectively non-Markovian too. The aim is to consider persistent random walks (S t) whose increments are Markov chains with variable order which can be infinite. This variable memory is enlighted by a one-to-one correspondence between (X n) and a suitable Variable Length Markov Chain (VLMC), since for a VLMC the dependency from the past can be unbounded. The key fact is to consider the non Markovian letter process (X n) as the margin of a couple (X n , M n) n≥0 where (M n) n≥0 stands for the memory of the process (X n). We prove that, under a suitable rescaling, (S n , X n , M n) converges in distribution towards a time continuous process (S 0 (t), X(t), M (t)). The process (S 0 (t)) is a semi-Markov and Piecewise Deterministic Markov Process whose paths are piecewise linear
    • …
    corecore